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Abstract 

This work proposes and study the concept of Functional Data Anal- 
^— H ysis transform, applying it to the performance improving of volumetric 

r- H Bouligand-Minkowski fractal descriptors. The proposed transform con- 

sists essentially in changing the descriptors originally denned in the space 
* of the calculus of fractal dimension into the space of coefficients used in 

the functional data representation of these descriptors. The transformed 
decriptors are used here in texture classification problems. The enhance- 
H ment provided by the FDA transform is measured by comparing the trans- 

it* formed to the original descriptors in terms of the correctness rate in the 

. ^ classification of well known datasets. 

o3 1 Introduction 

In recent years, the literature has presented a lot of applications of fractal theory 
to the solution of problems from distinct areas. As examples we may cite appli- 
cations in Botany |TJ[2j[3], Medicine HG3E3 and Geology [HHEI. Particularly, 
in Physics, we may find applications of fractal theory in Optics [TUJ [TTJ H2] . 
Materials Science p~3j [14j [15] and Electromagnetism p~6j [17j [18] , among many 
other areas. Such large amount of works exploring tools from fractal theory is 
fully justified by an interesting observation already pointed out in [15] . This ob- 
servation states that systems observed in the nature generally may be modelled 
by fractal measures rather than by classical formalisms. 
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Among the applications of fractal theory, most of them aim at using the 
fractal modeling in order to extract features from objects of interest according 
to the problem domain, like textures, contours, surfaces, etc. Such features 
are then provided as input data, for example, to methods for segmentation, 
classification and description of objects. A classical example of such fractal 
feature is the fractal dimension. 

As in the most of cases the simple use of fractal dimension is still not sufficient 
to well represent the complexity of an object or scenario from the real world, the 
literature developed techniques for the extraction of a set of features based on 
the fractal dimension. Examples of such approaches are Multifractal theory [201 
[2TJ[22], Multiscale Fractal Dimension (MFD) [23J [24] and Fractal Descriptors 
Hi [271 HE]. 

Here, we are focused on fractal descriptors approach. Several authors, like 
in [27|, 25 , 26 , 28], obtained interesting results in different applications of fractal 
descriptors technique to texture and shape analysis, mainly in the description 
of natural objects. Particularly, here we are focused on an approach developed 
in [26] which uses the volumetric Bouligand-Minkowski fractal dimension to 
generate a set of descriptors. Such descriptors obtained a high performance in 
an application to a task of plant leaves classification based on texture. 

Nevertheless, an important drawback of fractal descriptors technique, partic- 
ularly that based on Bouligand-Minkowski, is that the curve formed by the set of 
descriptors present a high correlation, that is, each descriptor is strongly depen- 
dent on each other. This correlation does their performance decrease drastically 
in problems of classification and segmentation with a high number of samples 
and classes. In such situations, volumetric Bouligand-Minkowski descriptors 
have severe limitations. 

Aimimg at enhancing Bouligand-Minkowski descriptors, preserving the reli- 
ability of the results, this work proposes the development and use of Functional 
Data Analysis (FDA) transform concept. Functional Data Analysis is a powerful 
statistical tool developed in [25] . It represents an alternative to the traditional 
multivariate approach and deals with complex data as being a simple analytical 
function: the functional data. FDA approach presents certain advantages in 
this kind of application, like the easy handling of data in nonlinear domains (as 
the case in Bouligand-Minkowski descriptors) and the intuitive notion of func- 
tional operations, like derivatives and smoothing, employed in the definition of 
fractal descriptors. 

Up our knowledge, Florindo et al. [28] is the first work to apply the FDA 
approach to fractal descriptors. In that work, functional data representation is 
used for reducing the dimensionality of the descriptors set in shape recognition 
problems. Here, we propose a different paradigm for FDA use, by defining the 
concept of FDA transform. The FDA transform is defined as the operation 
which changes the original data set (in this case, descriptors) space into the 
space of coefficients of functional data. The transform still presents two variants: 
the first uses the coefficient directly, the second performs a second algebraic 
transform, described in [30] . 

The relevance of the FDA transform is verified in experiments of classification 
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of two well known datasets, that is, Brodatz [31] and OuTex [32]. The results 
are compared in terms of classification correctness rate. It was considered two 
variants of the FDA transform and it was compared through three classifiers very 
well known in the literature: Linear Discriminant Analysis (LDA), K-Nearest 
Neighbors (KNN) and Bayesian [33 ] 134 ] 135] . 

This work is divided into seven sections, including this introduction. The 
following explains the concepts of fractal theory, fractal dimension and fractal 
descriptors. The third introduces the Functional Data Analysis theory and 
definitions. The fourth shows the proposed method. The fifth describes the 
experiments. The sixth section shows the results and the last section concludes 
the work. 

2 Fractal Analysis 

The literature shows a lot of applications of fractal geometry involving the 
characterization of natural objects and scenarios. Examples of such applications 
may be found in[10][ia[TS[a[8][IIl[T5][18]. Most of these works use the fractal 
dimension as a metric for describe the object. This strategy is justified by the 
fact that fractal dimension measures the complexity of a structure. Physically, 
the complexity corresponds to the irregularity or to the spatial occupation. 
These properties are tightly related to constitution aspects which allow the 
identification of such objects. 

An important drawback of using only fractal dimension is that it is a unique 
global value and is not capable of extract information about intricate details 
of a structure. With the aim of exploring fully the potential of fractal theory, 
the literature shows the development of techniques which provide not only a 
unique value but a set of values capable of describing in a richer way an object, 
based on the fractal theory. Among these techniques, we have the Multifractal 
[20] HU [22], the Multiscale Fractal Dimension [23] [ll] and the Fractal Descriptors 
[25] [26] E3 [28]. 

Multifractal theory replaces the fractal dimension analysis by the concept of 
fractal spectrum, capable of modeling objects which cannot be represented by 
a single fractal measure. Multifractal demonstrates to be an interesting tool to 
capture the different power-law scaling present in a system [20] [21] [22] . 

The literature still shows an alternative technique for the modeling of objects 
with fractal theory. This approach is the Multiscale Fractal Dimension (MFD) 
[23] [24]. In MFD approach, instead of simply calculate the fractal dimension 
from interest objects, a set of features is extracted from the derivative of the 
whole power-law curve used to provide the fractal dimension. 

An extension of MFD are the fractal descriptors [25] [26] [27] [28]. In this 
case, we extract features (descriptors) from an object through the calculus of 
the fractal dimension taking the object under different observation scales. These 
descriptors are used to compose a feature vector that could be mean as a "signa- 
ture" to characterize the object. Particularly, fractal descriptors demonstrate to 
be an efficient tool for the discrimination of natural textures like that analyzed 



3 



in the present work. The Figure [I] illustrates the discrimination power of fractal 
descriptors D(k) by showing two distinct textures whose fractal dimensions are 
identical but the curve of fractal descriptors is visually distinct. 



Texture 1 Texture 2 




20 40 60 80 

r 

(b) 

Figure 1: Two textures with the same fractal dimension present fractal descrip- 
tors totally different, (a) Original textures (both with fractal dimension 2.618). 
(b) Fractal descriptors from the same textures. 

The following sections describe in more details the aspects involved in fractal 
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descriptors technique, starting from the fractal dimension definition. 



2.1 Fractal Dimension 

Fractal dimension is a real positive number constituting the main measure ex- 
tracted from a fractal object. There is no absolute definition for the concept of 
fractal dimension. The most used and classical one is the Hausdorff-Besicovitch 
dimension. 

Hausdorff-Besicovitch dimension dim#(F) is a concept derived from the 
measure theory and is defined over a set F C 3? n as 

dim H (F) = {s}|inf {s : H 8 (F) = 0} = sup{H s (F) = 00} , (1) 

where H S (F) is the s— dimensional Hausdorff-Besicovitch measure, defined by 

H'(F) = limff'(F), (2) 

e— »0 

where 

H s e (F) = inf |^ \Ui\ s : U t is an e-cover of F j . (3) 

In above equations, || expresses for the diameter in 3? n , that is, 
\U\ = sup \x-y\:x,y eU. 

In many situations, the calculus of Hausdorff-Besicovitch dimension is very 
complex and even impracticable. In such cases, we can calculate it by general- 
izing the concept of classical Euclidean dimension [19] , In this way, we obtain 
the following expression 

dim ff (F) = lim^l, (4) 

e^O log(^) 

where N(e) is the minimum number of objects with linear size e needed to cover 
F [36]. 

Most of different definitions of fractal dimension are based on a generalization 
of Equation |4j expressed through 

0=Bm !f«l, (5) 

5^0 log(<S) 

where M is a set measure depending on the specific fractal dimension method 
and 5 is the scale parameter. As example of fractal dimensions defined from the 
previous expression we can cite the box-counting, the packing dimension, the 
Renyi dimension, etc [36] . 

Particularly, here we are focused on the Bouligand-Minkowski fractal dimen- 
sion [36]. As the Hausdorff-Besicovitch dimension, the Bouligand-Minkowski 
dimension also is based on a topological measure, in this case, the Bouligand- 
Minkowski measure measM calculated through 

(w Q x r V(dF®rS) ... 
measM {F,S,r) = hm — , (6) 

r_^0 j.n r 
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where F is the object (set) of interest, S is a structuring element with radius r 
and V is the volume of the dilation between S and the boudary dC of C. The 
Bouligand-Minkowski dimension itself is given by 

diniM(^, S) = inf {r, measM(F, 5, r) = 0} . (7) 

For an application to discrete objects represented in a digital image, the 
calculus is significantly simplified through the use of neighborhood techniques. 
In this way, the above expression becomes 

in which S is a disk with diameter e (also called dilation radius), C is the number 
of points pertaining to the dilation region F © S e and N is the topological 
dimension of the space in which F is immersed. 



2.2 Multiscale Fractal Dimension 

Although fractal dimension is an important measure, it is insufficient for a good 
representation of complex systems which present different fractal dimension de- 
pending on the observation scale taken into account. In order to provide a 
richer fractal-based information from an object, the literature shows the Multi- 
scale Fractal Dimension (MFD) [231124]. 

MFD consists in the application of a multiscale transform to the fractal 
dimension. The multiscale transform of a signal u(t) is the function [/(&, a), 
where b is directly associated with t and a is the scale variable. Essentially, the 
multiscale is performed through three approaches: scale-space, time- frequency 
and time-scale. In the following, we describe the approach used in MFD, e.g., 
scale-space. More details are found in [24] . 

Scale-space is a particular case of multiscale transform. It is based on the 
derivative of the signal followed by a convolution with a smoothing gaussian 
filter [37]: 

{(b,a)\a,b€ft,a>0,b£{U 1 (t,a)} z c}, 

where . z c expresses the zero-crossings . and f/ 1 ^, a) represents the convolution 
of the original signal u(t) with the first derivative of the gaussian g*, that is: 

U 1 (t,a)=u(t)*gl(t). 

In [23], the MFD is obtained from the Bouligand-Minkowski fractal dimen- 
sion in the following manner: 

d(log(A(r))) 

MFD a = 2 - K J K \ JJJ * g a , 9 
d(log(r)) 

where A(r) is the dilation area for each dilation radius r. In MFD technique, 
some characteristics of MFD curve, like maximum, minimum and area below 
the curve graph, are extracted to compose a feature vector for the analyzed 
object. 
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2.3 Fractal Descriptors 



Fractal descriptors [25j [26] [271 US are an extension of MFD concept where a 
feature vector is extracted from the fractal dimension calculated over a whole 
interval of scales. 

Generally speaking, fractal descriptors are obtained from the function u: 

w.\og(e) ->log(M(e)), 

where M is a measure depending on the fractal dimension estimation method 
and e is the scale parameter. 

The function u must be used directly, as in [26], or may be summited to a 
particular transform. For instance, in [25], the descriptors £) are extracted from 
the Fourier derivative of u: 



du 



%-\D(f)U(f)), 



where t is equivalent to log(e), U is the Fourier transform of u and D is Fourier 
derivative: 

D(f) = J27TU, 

where j is the imaginary number. In order to attenuate noises inherent to 
the derivative operation, one may still apply a convolution with a gaussian 
filter embedded in the Fourier derivative, as employed in [28]. Thus, the above 
expression becomes: 

Z> = f =T-HD(f)U(f)Gl), 

where G\ is the derivative of the gaussian g a in the Fourier domain. The Figure 
[2] shows the aspect of descriptors curve of an object. 





200 400 



(b) 



(c) 



Figure 2: Fractal descriptors curve, (a) Object analyzed, (b) Curve u(t). (c) 
Descriptors curve after Fourier derivative, followed by gaussian convolution. 

In [27], the descriptors are obtained from the Fourier derivative, followed by 
a Principal Component Analysis (PCA) transform, aiming at reduce correlation 
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among descriptors. In this way, a more reliable and consistent set of descriptors 
are provided to characterize plant leaf shapes analyzed in that work. 

Here, we propose the application of Functional Data Analysis, described in 
the following, as a transform to u, in order to generate more robust and precise 
fractal descriptors. 

2.4 Volumetric Bouligand-Minkowski Fractal Descriptors 

In this work we focus on a specific fractal descriptors approach developed in [26] 
called Volumetric Bouligand-Minkowski Fractal Descriptors (VBFD). The main 
idea is the calculus of Bouligand-Minkowski fractal dimension of a 3D surface 
taken under a range of observation scales. These descriptors are employed to 
describe texture images, that is, analysis of images based on spatial and color 
arrangement of pixels. 

In the first step, we map the intensity image Img E [1 : M] x [1 : N] —> 5ft 
onto a threedimensional surface 

Surf = {i,j,f(i,j)\(i,j) E [1 : M] x [1 : N]}, (10) 

such that 

f(hj) = {l,2,...,max-gray}\f = Img(iJ), (11) 

where max_gray is the maximum pixel intensity. This transform is illustrated 
in the Figure [3] 




(a) (b) 

Figure 3: Texture mapped onto a surface, (a) Original texture, (b) Threedi- 
mensional surface. 

In the following, each point of the surface is dilated by a sphere with vari- 
able radius r, like illustrated in the Figure [4j Finally, we analyze the dilation 
volume V(r), that is the number of points inside the structure composed by 
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the dilation with each radius r. V(r) also corresponds to the number of points 
with a distance at most r from the object. Thus, the exact Euclidean Distance 
Transform (EDT) [38 becomes an efficient tool for this calculus. 




(c) (d) 



Figure 4: Dilated surfaces with different radii, (a) Points from original surface, 
(b) Radius 2. (c) Radius 5. (d) Radius 10. 

In 3D space, EDT is defined as the distance of each point in the space to a 
subset of it. In our case, this subset is the surface and the EDT for each point 
outside Surf is given by 

EDT(p) = min{d(p, q)\q € Surf}, (12) 

in which d is the Euclidean distance. 

Exact EDT is characterized by the fact that distances present discrete values 
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E 

E = 0,l,V2,...,l,..., (13) 

where 

leD = {d\d=(f + j 2 ) 1 / 2 ;i,jeN} (14) 

The dilation volume is provided by 



i=l 

where 



V{r) = Y J QH), (15) 

Q(r) = (x,y,z)\g k (P) - [g r (P) n U r ^ gi (P)}, (16) 

such that 

( (x,y,z)\[(x-P x ) 2 + (y-P y ) 2 \ 

9An-{ +iz -P z f ] V2 = E(r);x,y,zeN /' (i7) 

where 

P=(x,y,z)\f(x,y,z)eSurf (18) 

The Bouligand-Minkowski fractal dimension FD of the surface is simply 
given through 

FD = 3-a (19) 

where a is the slope of a straight line fit to the curve log(V(r) x log(r)). The 
technique in [26] went beyond the simple fractal dimension calculus and uses 
all values log(V(r)) as descriptors for texture: the VBFD descriptors. Notice 
that log(V(r)) is directly related to the Bouligand-Minkowski dimension for 
maximum radius r. Moreover, each radius corresponds to an observation scale, 
from further (greater radius) to closer (smaller radius). 

In that work, they employed VBFD in the discrimination of plant leaves 
images, achieving excellent results. The Figure [5] shows the capability of MFD 
curve in discriminating two textures from different materials. 

Here, we propose the use of Functional Data Analysis [29] in order to enhance 
the performance of volumetric Bouligand-Minkowski descriptors. 



3 Functional Data Analysis 

Functional Data Analysis (FDA) [29, 39 is a statistical tool alternative to mul- 
tivariate analysis. While in multivariate statistic, we are interested in relations 
among observations of variables, in FDA, each observation of a set of variables 
is handled as a unique analytical function. Thus we extract measures from those 
functions, like derivatives, curvature, etc. In FDA terminology, each observation 
is called "data" and the function is called "functional data" . 

FDA has found applications in Economy, Biology, Meteorology, etc. like syn- 
thesized in [39]. The functional representation has some noticeable advantages 
in practical applications. For example, we simplify the global analysis of an 
observation with missing values, irregular sampling domain or noise. Besides, 



10 




11 



the analytical representation allows the application of operations like derivatives 
of different orders, curvatures, integrals, etc. These operations turn possible, 
for example, a more accurate analysis of the variability level of the data among 
other important characteristics. 

The main result of FDA theory attests that any observation of statistical 
variables with analytically smooth behavior may be perfectly represented by 
some analytical function. In practice, this function is not exactly known but 
we may obtain efficient approximations. The most common strategy for the 
obtainment of such approximations is the interpolation of the data through 
specific basis functions, like Fourier, wavelets, polynomials. Here we choose the 
use of B-splines functions, due to their flexibility in data representation. This 
process is also known as basis function development. 

For the computational statistical analysis we must represent the analytical 
function numerically in some manner. Then, the coefficients of basis functions 
in the interpolation are used as the effective functional representation and sta- 
tistical metrics, like mean and variance, are calculated from these coefficients. 
Here we also use a more complex representation in which the coefficients are 
previously summited to an algebraic transform. The Figure [6] illustrates FDA 
representation. 



Original data 




Coefficients 
a1,a2,a3,...an 



Figure 6: Functional data representation. From above, the curve of an original 
measure, the basis functions and the coefficients used to represent computation- 
ally the functional data. 
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3.1 Mathematical Foundations 

For computational purposes, the data which must be analyzed consists in the 
observation of discrete pairs of variables u = (xj, y l j)i<j<m\i<i<m m which 
x l - G V, y l j G 5ft are the pair values in the variable i and m l is the number 
of observed pairs for each variable. For example, an observation may be the 
measure of temperature along a day and each pair may correspond to the hour 
in which the temperature is measured and the respective temperature value. 

The fact is that in this hypothetical example, the temperature cannot be 
perfectly modelled by any continous function, capable of predicting exactly the 
temperature for any day and hour. In order to apply the FDA concept to such 
data, we need to use a very important result from FDA theory which affirms 
that a data capable of being analyzed through FDA may be represented by an 
analytical function if we take into consideration a tolerable approximation noise. 
The result demonstrates that we can find an analytical function u l such that 

yj = «'(.>•}) (20) 

where measures the noise inherent to the acquisition process. 

Although the function u is not known explicitly, some classical function 
interpolation techniques have been applied to the observation pairs yielding an 
approximation of u. Generally, this approximation method is based on the 
development of u into functions basis. This technique consists in the projection 
of approximating function u % in a subspace with q linearly independent basis 
functions {(j)i)\<i< q . In this way, the approximating function is represented by 
u l = Y^=i a j( uZ )<t>j- The values of <x/(V ) are called the projection coefficients 
and are calculated as those which minimizes 

m 1 q 

^2iy l j -^ a k(u l ')<l>k(x))} 2 ,i = l,...,n. (21) 

j=l k=l 

Particularly, in this work, the functions basis choosen were the B-splines, which 
already demonstrated good results in the data here analyzed as shown in [28] . 
In the following, we describe briefly the B-splines concept. 

3.2 B-splines 

B-spline is a particular kind of spline function. A spline is a real function 
composed by piecewise polynomial functions. 

More formally speaking, a spline may be defined as 

s : [a, b] 5ft. (22) 

The interval [a, b] is divided into n "knots" k as 

a = ki < k 2 < ... < k n = b. (23) 
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In each subinterval the spline s is given by the polynomial Pi. The 

order of the spline corresponds to the highest order of polynomials. Each poly- 
nomial Pi is called a basis of the spline function. 

A B-spline is a particular category of splines characterized by minimum 
support (number of points where the function has value different of zero) . Each 
B-spline basis Bij is defined through 

B i0 {t) = { l,if*i<*<*i+i*Ki*i<*i + i 
*' uv 7 \ 0, otherwise. v 7 

B id (t) = -^-B^t) + Wi-* Bi+1 ._ l{t) (25 ) 

H+j ~ H H+j+l ~ H+l 

Finally, the B-spline curve is given by 

n 

B(t) = Y,LiB ilP (t), (26) 

i=0 

where p is the degree of the basis and Li corresponds to the knots. 



4 Proposed Method 

Beyond its importance as a statistical analysis tool, FDA has demonstrated to 
be an efficient technique to extract relevant information from a large data set. 
For example, in [40], a large amount of data respect to the water quality is 
collected in a specific local. Thus, the FDA approximating function is obtained 
from each curve of observed values and coefficients otj{u l ) are used to extract 
important characteristics from data. 

In [28], FDA is used to reduce the dimensionality and extract utile infor- 
mation from fractal descriptors used in a task of shape analysis. In that case, 
instead of using directly aj(u l ) coefficients, it is employed a transform of that 
coefficients which takes into account the contribution of the function basis space 
used. 

This transform is performed by the canonical transform matrix <l> 

*(fe,Z)=<0 fc ,^>, (27) 

where <ft are the basis functions. Besides, to simplify the notation, we use 
a(u) = (cei(ix), a q (u)), corresponding to the set of coefficients from the q 
basis functions. Thus, the transformed coefficients j3{u) are given through 

P(u)=Sa(u), (28) 

where S is the result from the matrix $ decomposed by the Choleski method 
[30] . that is, S is a unique lower triangular decomposition matrix of such 
that & = SS*, where S* is the conjugate transpose of S. 



14 




(a) (b) (c) 



Figure 7: Discriminative property of FDA. (a) Original data curve, (b) FDA 
coefficients, (c) Transformed coefficients. 

The Figure [7] shows the discriminative power of FDA. We observe a data 
represented in two curves with similar visual aspect and the discrimination of 
FDA coefficients without and with transform. 

The present work extends the application in [28 to the analysis of volu- 
metric Bouligand-Minkowski descriptors, applied to texture classification. The 
motivation for this application comes from the fact that volumetric Bouligand- 
Minkowski descriptors corresponds to a typical case of data whose FDA repre- 
sentation is interesting, according to [29j|39]. In fact, the descriptors present 
a global smooth aspect, being therefore analytical. Besides, they are extracted 
from a nonlinear space (log-log curve) and then are provided by a domain ir- 
regularly spaced. Another motivation is the fact that fractal descriptors may 
involve a derivative operation which becomes more intuitive by the handling of 
descriptors as a function and not only as a simple set of non related values. 

Unlike the situation in [28], the objective of using FDA here is not the simple 
dimensionality reduction, even because volumetric descriptors are easily treated 
by traditional classification methods. A problem with volumetric descriptors 
is that, although they allow for the achievemet of good results, they present a 
high level of correlation, that is, the original descriptors have a high dependence 
among themselves. This fact implies in difficulties for discrimination tasks en- 
volving a large number of samples and classes. Our purpose is to evident iate 
patterns in the global structure of descriptors which turn possible the enhance- 
ment of the discrimination power of original volumetric descriptors. 

For this goal, we propose the use of direct coefficients a(u) or transformed 
coefficients /3(u) replacing conventional fractal descriptors in a classification 
method. We call this representation form as FDA transform. In fact, we have 
a typical transform, in which the data is mapped from the log-log space of 
volumetric Bouligand-Minkowski descriptors onto the space of coefficients of 
functional data. The Figure [5] illustrates the FDA transform steps. 
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Figure 8: Functional data representation. A diagram illustrating the steps in 
the FDA transform applied to fractal descriptors. 



5 Experiments 

The performance of the FDA transform on volumetric Bouligand-Minkowski 
fractal descriptors is tested in an application to the classification of textures 
from two different datasets. 

The first is the classical Brodatz texture dataset [31 , composed by 111 
classes, each one with 10 samples (images) corresponding to photographies of 
real world textures. The second analyzed dataset is the also well known Outex 
dataset [32], composed by 68 classes with 20 images in each class. 

The steps in the experiments may be summarized through the following 
items: 

1. Extraction of Volumetric Bouligand-Minkowski descriptors from each im- 
age in the dataset; 

2. Computation of a coefficients of the approximating analytical function 
(functional data); 

3. Obtainment of the coefficients f3 after the transform described in the Sec- 
tion |4| 

4. Use of coefficients a and j3 as input to different classification methods; 

5. Comparison, in terms of classification performance, among the proposed 
approach and the direct use of volumetric Bouligand-Minkowski descrip- 
tors. 

The performance of the FDA transform is verified in direct approach (using 
a(u)) and transformed approach (using P(u)). The basis used was B-spline. 
For the classification process we use classical methods from the literature [33] , 
that is, Bayesian, K-Nearest Neighbor (KNN) and Linear Discriminant Analysis 
(LDA). 
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Figure 9: Some examples of images from the Brodatz dataset. 



6 Results 

The results are showed in graphs and tables which represent the different ways 
for the use of the FDA transform combined to Bouligand-Minkowksi descriptors 
in the datasets analyzed. Empirically, we found an optimal interval for the num- 
ber of descriptors used, that is, between 60 and 100 for direct FDA coefficients 
and between 10 and 50 for transformed FDA fractal descriptors. 



Firstly, the Figure 11 shows the correctness rate for the use of FDA fractal 
descriptors in the classification of Brodatz data set. At left, we show the results 
for normal FDA. At right, for transformed FDA. From above to below, we use 
Bayesian, KNN and LDA classifier. Initially, we cannot notice any direct relation 
among number of descriptors, basis order and correctness rate. The exception 
occurs with the use of LDA with transformed FDA descriptors. In this case, it 
is clear that the correctness rate increases with the number of descriptors. In 
the most of cases, however, it is noticeable that higher order basis yield greater 
correctness. This is explained by the fact that those basis are capable of capture 
more details from the conventional VBFD descriptors. Relative to the number of 
descriptors, the graph shows that each specific combination of FDA descriptors 
and classifier provides a different pattern for the correctness rate results. This 
is also waited due to the fact that each classifier has a particular way of dealing 
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Figure 10: Some examples of images from the OuTex dataset. 



with correlation and irregularity information. 

The Figure [2] shows the correctness rate in Outex data set. The graphs are 
organized in the same way as in the Figure [TT] The observations from Brodatz 
data set are also valid in this case. Particularly, an interesting observation is 
that the aspect of the graph of each combination descriptor /classifier is similar 
in both data sets. The unique significant difference is the global correctness that 
is smaller in Outex, due to its greater difficulty level when compare to Brodatz 
data set. 

Now we show the best results achieved by each combination of descriptors 
and classifiers and the number of used descriptors. In the Table [I] we show the 
correctness rate for Brodatz data set. We observe that even using a reduced 
set of descriptors, FDA achieved results sensibly more precise than classical 
VB fractal descriptors. This advantage is more notable in KNN and Bayesian 
classifier. In Bayesian, FDA presented an advantage of 42% while in KNN 
this advantage was 27%. Another important observation from the table is that 
in this specific application the use of normal FDA coefficients demonstrated 
to be the better solution. This is very encouraging since this FDA approach 
is computationally simple and allows an easy statistical interpretation of the 
analyzed data. 

Concluding, we present the results of FDA descriptors in OuTex dataset. 
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Figure 11: Correctness rate for Brodatz data set. At left, using normal FDA 
coefficients. At right, using transformed FDA descriptors. From above to below 
using Bayesian, KNN and LDA classifier. 
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Figure 12: Correctness rate for Outex data set. At left, using normal FDA 
coefficients. At right, using transformed FDA descriptors. From above to below 
using Bayesian, KNN and LDA classifier. 
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Descriptors 


Classifier 


Number of descriptors 


Correctness rate (%) 


Original 


Bayesian 


86 


40.0 ± 0.2 


Normal FDA 


Bayesian 


80 


56.8 ± 0.1 


Transformed FDA 


Bayesian 


10 


43.2 ± 0.1 


Original 


KNN 


86 


54.8 ± 0.1 


Normal FDA 


KNN 


75 


69.8 ± 0.2 


Transformed FDA 


KNN 


20 


56.8 ± 0.1 


Original 


LDA 


86 


98.6 ± 0.1 


Normal FDA 


LDA 


70 


99.0 ± 0.1 


Transformed FDA 


LDA 


50 


98.2 ± 0.0 



Table 1: Correctness rate for the use of fractal descriptors enhanced by FDA 
normal and transformed coefficients in Brodatz dataset. 

Again, the performance of FDA descriptors is very good. A highlight must be 
given to the Bayesian result. FDA provided a correctness rate 123% greater 
than classical fractal descriptors. 



Descriptors 


Classifier 


Number of descriptors 


Correctness rate (%) 


Original 


Bayesian 


86 


23.0 ± 0.1 


Normal FDA 


Bayesian 


80 


51.4 ± 0.2 


Transformed FDA 


Bayesian 


25 


24.9 ± 0.1 


Original 


KNN 


86 


47.3 ± 0.2 


Normal FDA 


KNN 


65 


57.0 ± 0.1 


Transformed FDA 


KNN 


15 


48.1 ± 0.1 


Original 


LDA 


86 


92.0 ± 0.0 


Normal FDA 


LDA 


80 


92.5 ± 0.1 


Transformed FDA 


LDA 


50 


91.8 ± 0.2 



Table 2: Correctness rate for the use of fractal descriptors enhanced by FDA 
normal and transformed coefficients in Outex dataset. 

From the previous results, we observe that we cannot extract an exact re- 
lation between the number of FDA basis (and, consequently, descriptors), the 
order of used basis and the correctness rate results. However, analyzing with- 
out excessive severity, we observe that generally the use of higher order basis 
increases the classification performance. Nevertheless, it is always important to 
verify the combinations for each different application. 

Analyzing more globally the results, we observe initially that the FDA 
transform provides a significant increasing in the performance of volumetric 
Bouligand-Minkowski descriptors, mainly when we used KNN and Bayesian 
classifier. This fact attests that the FDA transform is capable of extract rele- 
vant features from the set of descriptors, allowing for the classifiers to provide 
a more precise classification result. As discussed in the Section |4j the good per- 
formance of the FDA transform was expected due to the smooth analytical and 
irregularly spaced nature of Bouligand-Minkowski descriptors. The smaller effi- 
ciency in LDA classifier is easily explained by the fact that one step in the LDA 
method involves a correlation space transform (Principal Component Analysis). 
So, features extracted by the FDA transform do not necessarily to have the 
same correlation properties as the original descriptors and this fact prejudices 
the performance of the whole classification process. 
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7 Conclusions 



This work proposed and analyzed the use of the FDA transform aiming at en- 
hancing the performance of volumetric Bouligand- Minkowski fractal descriptors, 
applied to texture classification. The transform consists in the use of coefficients 
from the functional data representation replacing the original descriptors. 

Results demonstrated that the FDA transform increased significantly the 
accuracy of classification process, mainly when using Bayesian and KNN classi- 
fiers. Results confirmed what is expected from the theory, once FDA is a pow- 
erful statistical tool for the representation of smooth analytical data, like fractal 
descriptors. The FDA transform extracts important features and patterns from 
the original descriptors set yielding a better classification performance. 

Results suggest strongly that FDA must be considered as an auxiliary tool 
for other methods shown in the literature for obtaining fractal descriptors or 
even other techniques in texture analysis that generate a set of values which 
may be handled as an analytical function. 
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